Unicode, ucs-2, ucs-4, UTF-16, utf-32, UTF-8
Unicode details
Copyright Notice: It can be reproduced at will, but the original author charlee and original link http://tech.idv2.com/2008/02/21/unicode-intro/must be indicated in a timely
Reference Address: http://www.cnblogs.com/kingcat/archive/2012/10/16/2726334.htmlIn Java, char types describe a unit of code with UTF-16 encodingWhy Unicode is requiredWe know that the computer is actually very stupid, it only know 0101 such a
Unicode
We know that the computer is actually stupid. It only knows strings like 0101. Of course, we certainly feel dizzy when we look at the 01 string, so many times we simply use decimal to describe it, hexadecimal format, which is octal. in fact,
Why Unicode is requiredWe know that the computer is actually very stupid, it only know 0101 such a string, of course, we look at such a 01 string when it will be more dizzy, so many times in order to describe the simple are in decimal, hexadecimal,
Original address:http://www.cnblogs.com/kingcat/archive/2012/10/16/2726334.html Why Unicode is requiredWe know that the computer is actually very stupid, it only know 0101 such a string, of course, we look at such a 01 string when it will be more
Basics of UTF-8 Character Set
Brief character set history
Among all character sets, the most well-known number is the 7-bit ASCII character set. It is short for American Standards Committee for information interchange. It is designed for American
Concept Let's talk about the basic concept, which includes what is Unicode, What Is UTF-8, and what is UTF-16. For a complete description of Unicode, UTF-8, and UTF-16, see Wiki (UNICODE, UTF-8, UTF-16 ). In simple terms, Unicode defines all the
UTF-8 is a unicode implementation method, that is, its byte structure has special requirements, so we say a Chinese character range is 0x4e00 to 0x9fa5, refers to the Unicode value, as for UTF-8 encoding, it is organized by three bytes, so we can
UTF-8 is a unicode implementation method, that is, its byte structure has special requirements, so we say a Chinese character range is 0x4e00 to 0x9fa5, refers to the Unicode value, as for UTF-8 encoding, it is organized by three bytes, so we can
Turn: http://www.utf.com.cn/article/s1383
These related things are not complicated, but they are very easy to tell, especially recently I have read some of theseArticleEven if it is regarded as the source of authority, conflicts often occur,
The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion;
products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the
content of the page makes you feel confusing, please write us an email, we will handle the problem
within 5 days after receiving your email.
If you find any instances of plagiarism from the community, please send an email to:
info-contact@alibabacloud.com
and provide relevant evidence. A staff member will contact you within 5 working days.